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LECTURE 4 
SUBSTITUTION WITH VARIANTS 
Part III 
MULTILITERAL SUBSTITUTION 


SUMMARY 


Welcome back from the Thanksgiving holiday break. The good news is that this lecture will come to you about Christmas, therefore, no homework. The not so good 
news is that this concluding Lecture 4 on Substitution with Variants covers some difficult material of wide practically in the field. 


In Lecture 4, we complete our look into English monoalphabetic substitution ciphers, by describing multiliteral substitution with difficult variants. The Homophonic and 
GrandPre Ciphers will be covered. The use of isologs is demonstrated. A synoptic diagram of the substitution ciphers described in Lectures 1-4 will be presented. 


MULTILITERAL SUBSTITUTION WITH MULTIPLE-EQUIVALENT CIPHER ALPHABETS - 
aka "MONOALPHABETIC SUBSTITUTION WITH VARIANTS" 


Each English letter in plain text has a characteristic frequency which affords definite clues in the solution of simple monoalphabetic ciphers. Associations which individual 
letters form in combining to make up words, and the peculiarities which certain of them manifest in plain text, afford further direct clues by means of which ordinary 
monoalphabetic substitution encipherments of such plain text may be readily solved. [FR1] 


Cryptographers have devised methods for disguising, suppressing, or eliminating the foregoing characteristics in the cryptograms produced by methods described in 
Lectures 1-3. One category of methods called "variants or variant values" is that in which the letters of the plain component of a cipher alphabet are assigned two or more 
cipher equivalents. 


Systems involving variants are generally multiliteral. In such systems, there are a large number of equivalents made available by combinations and permutations of a limited 
number of elements, each letter of the plain text may be represented by several multiliteral cipher equivalents which may be selected at random. For example, if 3- letter 


combinations are employed as multiliteral equivalents, there are 26? or 17,576 available equivalents for the 26 letters of the plain text. 


They may be assigned in equal numbers of different equivalents for the 26 letters, in which case each letter would be representable by 676 different 3 letter equivalents or 
they be assigned on some other basis, for example proportionately to the relative frequencies of the plain text letters. [FR1] 


The primary object of substitution with variants is again to provide several values which may be employed at random in a simple substitution of cipher equivalents for the 


plain text letters. 


As a slight diversion, the reader may ask about uniliteral substitution with variants. It is but not very practical. Note the following cipher alphabet constructed in French by 
Captain Roger Baudouin in reference [BAUD]: 


Phin [afo [e d|e f|g|h|i[1 mnjopp[a|r[s tv v|x | 


кечер а аніс ма pine evil 
Iv 


(Note that the Captain was not an ACA member. The Н=Н combination is not allowed.) 


Baudouin proposed that the J and Y plain be replaced by I plain and K plain by C plain or Q plain and W plain by VV plain. Four cipher letters would be available as 
variants for the high- frequency plain text letters in French. 


Mixed alphabets formed by including all repeated letters of the key word or key phrase in the cipher component were common in Edgar Allen Poe's day but are 
impractical because they are ambiguous, making decipherment difficult; for example: 


Enciphering Alphabet: 


Phin [a [o| e [дег вір k 1|m|n o]p|a r|s саум х/у 2 


[cipher [N [ow [1 [s [rH efre [o [кА |o o[o]o [D M E]N [T 


Inverse form for deciphering: 


ICipher [А (В C [D|E|F |G|H|1 кум о [ео [в (тоу |ху (2: 


"Рап |р ivihimis|gid кај [ое [e 


i kih В 


х |t | 
n 


The average cipher clerk would have difficulty in decrypting a cipher group such as TOOET, each letter having 3 or more equivalents, from which plain text fragments (n) 
inth, ft thi(s), it thi, etc. can be formed on decipherment. [FR1] 


THEORETICAL DISTINCTIONS 


In simple or single- equivalent monoalphabetic substitution with variants, two points are evident: 
1) the same letter ofthe plain text is invariably represented by but one and always the same character or cipher unit of the cryptogram. 
2) The same character or cipher unit of the cryptogram invariably represents one and always the same letter ofthe plain text. 

In multiliteral - equivalent monoalphabetic substitution with variants, two points are also evident: 
1) the same letter ofthe plain text may be represented by one or more different characters or cipher units of the cryptogram. But, 


2) The same character or cipher unit of the cryptogram nevertheless invariably represents one and always the same letter of the plain text. 


SIMPLE TYPES OF CIPHER ALPHABETS WITH VARIANTS 


УМ ХЈУ |Z 

lo|R[s |T|U 
* JE *[* * *[* * 
lL Е [alae chle 
м|с[в |+ | а H|U Kk 
NIB|c|* [|м [о/р 
Го [1 [p*lo[R [5 [T|u 
P |к[Е|+[у|м|х|у [2 


WIx|v|z. 

о[в [5 [rlu 

L|M|N|o|P 

IF [cH | | 

А|в [c [рје 

ck pk |k |k | 

ГУ То. ТЕ ГАТА ГВ ciple 
WIR M|G|B|* F [о [H|u |K 
Ix Nis [H|c|*/L|M|N|o|P | 
v[r|o|t|p|*|o[n |s |T|u 
z ufe k[E|*Iv|wIx|v|z 


Figure 4-5 
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The matrices in Figures 4 - 1 to 4- 10 represent some of the simpler means for accomplishing monoalphabetic substitution with variants. The matrices are extensions of the 
basic ideas of multiliteral substitution presented in Lecture 3. 


The variant equivalents for any plain text letter may be chosen at will; thus, in Figure 4-1, е= 10, 15, 60, or 65; in Figure 4-2, e- AU, AZ, FU, FZ, LU or LZ. 


Encipherment by means of matrices shown in Figures 4-2, 4-3, 4-6 is commutative. The coordinates may be read row by column or visa versa. There is no cryptographic 
ambiguity. The remaining matrices are noncommutative. The general convention is to read row by column. 


In Figures 4-5 and 4-6, the letters in the square have been inscribed in such a manner that, coupled with the particular arrangement of the row and column coordinates, the 
number of variants available for each plain text letter is roughly proportional to the frequencies of the letters in the plain text. Figure 35 incorporates a keyword on top of 
this idea. [FR1] 


HOMOPHONIC 

The Homophonic Cipher is a simple variant system. It is a 4-level (alphabets) dinome cipher. Consider Figure 4-11. 
Figure 4-11 

А |B|c [р [е [c [HR [v | |t |M|N 
гов [09/10/11 [12 [13 [14 15 16 |17 [18 19 |20 
35 [36 [37 [38 [39 [40 [41 |42 |43 |a4 |45 [46 |47 
68 [69 |70 [71 [72 [73 [74 [75 [51 [52 |53 [54 [55 
[87 (вв [89 [90 [91 [92 [93 [94 [95 [96 |97 [98 [99 
Го [р [о [2 [5 [т [0] УМ] Х |v [2] 

21 |22 [23 [24 [25 [01 [02 [03 [04 [05 [06 [07 


г гг гг | | г и Pr ee ee 


148 |49 |50 |26 |27 |28 |29 |30 |31 |32 |33 |34 


e sr|ss 5» eo et [sz 60/965 6667 


[00 [76 |77 |78 [79 [80 [81 |82 |83 [84 [85 |86 


The keyword TRIP is found by inspecting dinomes 01, 26, 51, and 76. (The lowest number in each of the four sequences.) [FR1] [FR5] 


The Russians added an interesting gimmick called the Disruption Area. Consider Figure 4- 12 and note the slashes under U - X for the fourth level of dinomes. The famous 
VIC cipher used this feature very effectively. [NIC4] 
Figure 4-12 


А |в [с [ре [е [clafi |» |K |t |M 
[14[15[16[17[18 [19 [20 |21 [22 [23 [24 [25 [26 
27 28 |29 |30 [31 (32 [33 [34 |35 [36 [37 [38 [39 
[58 |59 [60 [61 [62 [63 |64 |65 [66 [67 [68 |69 [70 
[81 [82 [83 [84 [85 [86 [87 [88 [89 [90 [91 [92 [93 
мјојР [о [к [5 | т [о] У | у] Х | 7] 2 
01 02 [оз [04 [05 [06 [07 [08 [09 [10 11 12 [13 
[40 [41 [42 [43 [44 45 |46 [47 [48 |49 [50 [51 [52 
72 |73 [74 53 [54 |55 
95 |96 [97 "2n 


The keyword NAVY is represented by dinomes 01, 27, 53, and 79. 


Security for Homophonic systems is greatly improved if the dinomes and the four sequences are assigned randomly. However, the easy mnemonic feature of the 
keyworded four sequences is lost. 


The Mexican Cipher device is a Homophonic consisting of five concentric disks, the outer disk bearing 26 letters and the other four bearing sequences 01-26, 27-52, 53- 
78, 79-00. The cipher disk enhances frequent key changes. Figure 4-12 shows the matrix without the disruption area. [FR5] [NIC4] 


HOMOPHONIC CRYPTANALYSIS 


Lets solve the following cryptogram. 


68321 09022 48057 55111 88648 42036 45235 09144 
05764 22684 00225 57003 97397 14074 82524 40768 
51058 93074 92188 47264 09328 04255 06186 79882 
85144 45886 32574 55136 56019 45722 76844 68350 
45219 71649 90528 65106 11886 44044 89669 70553 
18491 06985 48579 33684 30957 70612 09795 29148 
56109 08546 62062 65509 32800 32568 97216 44282 
34031 84989 68564 53789 12530 77401 68494 38544 
11368 87616 56905 20710 58864 67472 22490 09136 
62851 24551 35180 14230 50886 44084 06231 12876 
05579 58980 29503 99713 32720 36433 82689 04516 
52263 21179 06445 72255 68951 86957 76095 67215 
53049 08567 9730 


Assuming we did not know that the above cryptogram was a HOMOPHONIC, we might make a preliminary analysis to see if we are dealing with а cipher or а code. 
We will cover code systems later in the course, but a few introductory remarks might be in order. The five letter groups could indicate either a cipher or a code. 


Ifthe cryptogram contains an even number of digits, as for example 494 in the previous message, this leaves open the possibility that the message is a cipher containing 
247 pairs of digits; were the number of digits an exact odd multiple of five, such as 125, 135, etc., the possibility that the cryptogram is in code of the 5-figure group type 
must be considered. 


We next study the message repetitions and what their characteristics are. If the cipher text is of 5-figure code type, then such repetitions as appear should generally be in 
whole groups of five digits, and they should be visible in the text just as the message stands, unless the code message has been superenciphered. If the cryptogram is a 
cipher, then repetitions should extend beyond the 5 -digit groupings; if they conform to any definite at all they should for the most part contain even numbers of digits since 
each letter is probably represented by a pair (dinome) of digits. 


We start with 4-part frequency distribution. We next assume a 25 character alphabet from 01-00. This is the common scheme of drawing up the alphabets. Breaking the 
text into dinomes (2-digit) pairs yields: 
от [02 [оз [04 [05 | 06 | 07 [os| o9 | 10 | 11 [12 13| 14 [15 |16|17| 18 [19 | 20 |21 22 [23 |24 |25 


| [и] [ж и шл [ж m rr и [жш |/ p pmo р 


26 [27 [28 [2930 | 31 | 32 [33 34 | 35 [36 [37 |38 | 39 | 40 |a1 [42 | 43 | 44 | 45 |46| 47 [ав [49 | 50 
Hi pep [ж |, o wr pr pw) wor mnm | уу 


| 51 |52 |53 |54| 55 | 56 | 57 |58 | 59 | 60 |61 |62|63| 64 | 65 166/67] 68 | 69 | 70 |71| 72 |73 74 | 75 


| 76 |77 |78 |79|80 | 81 | 82 |83| 84 | 85 |86 |87 [88 | 89 | 90 |o1|o2| 93 | 94 | 95 [96 | 97 [98 | 99 | 00 


What we have before us are four simple, monoalphabetic frequency distributions similar to those involved in a monoalphabetic substitution cipher using standard cipher 
alphabets. The next step is to fit the distribution to the normal. Since I=J for the 25 letter alphabet, we find that the Keyword is JUNE and the following alphabets result: 


01 Isg 26 U $3. N 76 E 
02 K 2y УМ 52 0 17 PF 
03 L 28 W 22? 78 С 
04 M 29 X 54 Q 79 H 
05 N 30 Y 55 R 80 IJ 
06 0 31 Z 56 5 81 K 
07 P 32 А oo X 82 L 
08 о 33 В 28 U 83 M 
09 R 34 € 59 v 84 N 
108 S 35 D 60 W 85 Q 
ii T 36 E 61 X 86 P 
12 UU ot P 62 Y 87 Q 
i$ У 38 G 63 -Z 88 R 
14 W 39 H 64 A 89 6 
15 Ж 40 IJ 55 B so T 
16 Y 41 K 66 C 91 U 
IT Ww 42 L 67 D 92 ү 
18 А 43 M 68 E 93 W 
19 B 44 N 69 F 94 X 
20 С 45 О T0 © 39 Ж 
21 Q 46 P TL Н 36 z 
22 E 47 Q 72 XJ 97? A 
23 F 48 R 73 К 98 B 
24 G 49 5 74 L 99 С 
25 H 5D T 39. M 00 D 


The first groups of the cryptogram decipher as follows: 
(вв [32 [10 90 [22 fas [os [76 [s1 [1t [88 [64 [84 [20 [36 45 [23 
е а |» [а е1 а е а [е [5 [а Га ее [ое 


If a 26-element alphabet were used only the distribution analysis would have been changed to be on a basis of 26, ће process of fitting the distribution to ће normal 
would be the same. 


PLAIN COMPONENT COMPLETION METHOD 


Suppose we know that two correspondents have been using the same variant system as in the previous Homophonic. The message intercepted is: 


48226 88423 52099 93604 76059 05651 36683 52267 


97114 54466 76 


A variation of the plain-component completion method can be used to crack the new message. We copy the message into dinomes and separate by levels. 


— 


48 |22 |68 [84 |23 |52 [09 [99 [36 [04 [76 [05 |90 [56 |51 [36 |68 |35 22 |67 [97 [11 |45 |44 |66 |76 
2 1 [s [4 1 [3 1 [4[2 [ 1 [a [n [a [3 [3 2 3 [2 1 ]3 [4] 1] 2] 2] 3] 4 


) 22 23 09 04 05 22 11 
) 48 36 36 35 45 44 

(3) 68 52 56 51 68 67 66 
) 84 99 76 90 97 76 


These dinomes are converted into terms of plain component by setting each of the cipher sequences against the plain component at an arbitrary point of coincidence, such 
as the following: 


А |в [с [р (е [е [c [B [v |K |t |M|u [o] P |o|R|s [T[v|v |w|x |v |z 
[01 [02 [оз [04 [05 [06 [о7 [08 [oo [10 [11 [12 [13 [14 [15 [16 [17 |18 [19 [20 [21 [22 [23 [24 [25 
26 (27 [28 |29 |30 |31 [32 133 [34 [35 [36 [37 [38 [39 [40 |41 |42 Jas [44 |45 [46 [47 [48 [49 [50 
51 [52 |53 [54 [55 [56 57 |58 [59 [60 [61 [62 [63 [64 [65 [66 [67 [68 [69 [70 |71 |72 |73 [74[75 


76 [77 [78 [7980 [81 |82 [8s ви [8s ве [57 |в [89 90 [91 |92 [55 94 [95 [96 [97 98 [59 [oo 


) 22=W; 23=Х; 09=1; 04-D; 05-E; 22-W; 11=L 
) 48=X; 36=L; 36-L; 35=K; 45-U; 44=T 
(3) 68=S; 52=B; 56=Е; 51=A; 68=5; 67-R; 66=Q 
) 84-21; 99=Y; 76=А; 90=P; 97-W; 76=А 


This method works because both the plain component (A,B..) and the cipher component (01, 02..) are known sequences. 


The plain-component sequence is completed on the letters of the four levels by Caesar Rundown, as follows: 
Level 1 | Level 2 | Level 3 Level4 


WXIDEWL |XLLKUT |SBFASRQ |IYAPWA 
XYKEFXM |YMMLVU TCGBTSR | |KZBQXB 
YZLFGYN |ZNNMWV|UDHCUTS |LACRYC 
ZAMGHZO |AOONXW [VEIDVUT | |MBDSZD 
ABNHIAP |BPPOYX |WFKEWVU |NCETAE 
BCOIKBQ |CQQPZY |XGLFXWV |ODFUBF 
CDPKLCR |DRRQAZ |YHMGYXW |PEGVCG 
DEQLMDS [ESSRBA [|ZINHZYZ  |QFHWDH 


EFRMNET |FTTSCB |AKOIAZY  |RGIXEI 
FGSNOFU |GUUTDC |BLPKBAZ |SHKYFK 


GHTOPGV [HVVUED |CMQLCBA [TILZGL 
HIUPQHW |IWWVFE |DNRMDCB [UKMAHM 
IKVQRIX |KXXWGF |EOSNEDC | VLNBIN 
KLWRSKY |LYYXHG |FPTOFED [WMOCKO 
LMXSTLZ [MZZYIH |GQUPGFE |XNPDLP 
MNYTUMA[NAAZKI [HRVQHGF |YOQEMQ 
NOZUVNB |OBBALK |ISWRIHG  |ZPRFNR 
OPAVWOC[PCCBML [KTXSKIH | |AQSGOS 
PQBWXPD |QDDCNM |LUYTLKI |BRTHPT 
QRCXYQE [REEDON |MVZUMLK |CSUIQU 
RSDYZRF |SFFEPO [NWAVNML |DTVKRV 
STEZASG |TGGFQP |OXBWONM |EUWLSW 


TUFABTH |UHHGRQ |РҮСХРОМ |FVXMTX 


[UVGBCUI | VIIHSR |QZDYQPO IGWYNUY 
VWHCDVK|WKKITS |RAEZRQP |HXZOVZ 


The generatrices with the best assortment of high frequency letters for the four levels are: 
Level 1 Level 2 Level 3 Level 4 


EFRMNET REEDON EOSNEDC NCETAE 


Arranging the letters of these generatrices in order of appearance of their dinome equivalents, according to levels we have: 
‘as 22 [68 [84 [23 [52 [о 9 [зе [04 [76 [os [so [56 [51 [5e Jes [35 [2:[67 [97 
E 


NN BENIN [| Р| | | 


— 


The plain text reads "Reinforcements needed a[t once]". 
Looking at the equivalents 01,26, 51, 76 we reveal the keyword JUNE. 


In evaluating generatrices, the sum of the arithmetic frequencies of the letters in each row may be used as an indication of the relative "goodness". A statistically better 
procedure uses the logarithm of the probabilities ofthe plain text letters forming the generatrices. See [FR2] 


The Homophonic is a popular cipher and has been discussed in several issues of The Cryptogram as well as LEDGES' NOVICE NOTES. See references [HOMI - 
HOM6] and [LEDG]. 


For our computer bugs, TATTERS Homophonic solver is very easy to use and available on the Crypto Drop Box. 


MORE COMPLICATED TYPES OF CIPHER ALPHABETS WITH VARIANTS 
GRANDPRE 


Consider the cipher matrices shown in figures 4-11 to 4-13. These are called frequential matrices, since the number of cipher values available for any given plain text letter 
closely approximates its relative plain text frequency. 


Figure 4-11 

A|B [c |p|E WIx|v 
А |“ [та |A |v|R. [E [c [A 
в [**[s |.|: [E |v. в Nis 
c [*[*[с[м[р[ом un 
Ip |**R[A P |T|F- [ү fs [o 
Еј] x|N[E E 


|-- 
m] 
> |с|а|ш|=] [ој < | az] =] У] N 


ja> |о| = 


Heiek [oTeTe]m 7 Tx] 


( 676 - cell matrix ) 


In figure 4-11, the number of occurrences of a particular letter within the matrix is proportional to the frequency in plain text; the letters are inscribed in random manner, in 
order to enhance the security of the system. 


Figure 4-12 

e[8 | 1 [5 [4 [3 17 |2 |o 
(|+ [+ АГА [А [с [p |E|E [1 [г [ч 
H[*I*lA[A[c[p [E |E]H K|N [o 
(з [++ А вре [E [ніз [NJo [к 


8|*|*|A|D|E ЈЕ ЈН I |NJOIR |S 
[9[* [* [СЕ ЈЕ [а |: |NIo|R|s [т 
;| “ЕВЕ ЈЕ l [м[о[о[5|т [т 
lof- lE ЈЕ |1 Mlo P ЈЕ [T [T [0 
5[*[*[Е [т LI fe ЈЕ |5 |T|u x 
6k [хр R (8 |т |jv|w|v 
[4l*I*ILIN[ol [s [т/м [7 


In figure 4-12, the same idea as 4-11 is presented in reduced form from 26 x 26 to 10 x 10. The letters have been inscribed by a simple diagonal route, from left to right, 
within the square, and the coordinates scrambled by means of a key word or key number. 


Figure 4-13 illustrates the famous Grandpre Cipher; in this square ten words are inscribed containing all the letters of the alphabet and linked by a column keyword 
("equivalent") as a mnemonic for inscription of the row words. ACA literature also covers this cipher. See references [LEDG] and [GRAL - 3] for solution hints for the 
Grandpre cipher. 


SACCO 


General Luigi Sacco proposed a frequential-type system that uses both enciphering and deciphering matrices. The inscribed dinomes were completely disarranged by 
applying a double transposition to suppress the relationships between letters. References [SACC] and [FR1] both give a good description of the process. The number of 
variant values in this system are reflective of the Italian language. 


BACONIAN 


The Baconian ciphers found in the Cryptogram are a variant system. The "a" elements may be represented by any one of 20 consonants as variants, while the "b" elements 
may be represented by any one of 6 vowels; or the letters A-M may be used to represent the "a" elements and the letters N-Z for the "b" elements; digits may be used for 
either the "a" or "b" elements, either on the basis of first five or last five digits, or odd versus even digits, or the first 10 consonants (B-M) and the last 10 consonants (N-Z) 


SUMMING-TRINOME 


Friedman describes a complex variant known as the summing- trinome system. Each plain text letter is assigned a value from 1-26; this value is expressed as a trinome, the 
digits of which sum to the designated value of the letter. The letter assigned the value of 4 may be represented by any of 15 permutations and combinations. Friedman 
discusses further ways of complication including disarrangement, addition of punctuation and nulls. See [FR1] pages 109-110. Note the inverted normal distribution 
representation of this cipher. 


ANALYSIS OF A SIMPLE VARIANT EXAMPLE 


The following cryptogram is available for study: 


QMDCV|PLFNF |DHNWJ WLKDKJ|NHBPV 


RLTVM [BKLWD|WVHVK|SHBCL [PQKJR 
VWSML|KGCNR [LRNKV |MGFXW|JRGMV 
WGTJH |QKXFN ZVFDM |LTBPL [PVFLM 
DCNWN[HBCVZ |NMLWQ|FDHDW|VZBRV 
KLCVC |vRDHL [RVTLF [NCDKG|MXWXM 
DTSCB [CLZLR |LMvTS |ZNKBW|VPBRN 
CLRXR |DCNKV|PBTNT |GHJZL |ЕОЕУК 


BWDZX |PNHSP |GHLKL |F VZLT |[VMLKD 


PQRNZ |LZDTB |MNTGM NZVFX |KSFDC 


LZVTV [FDFVR |GCLPQ |PNCDW|VRJTN 


HLZLM |VWNPV PDZDW JPNWL RJKVM 


XMDTS |MGFDR |DKLWJ |FLPJM |SFQWB 
FNCBZ [DKVWG|ZSHBH |DHJCX 


Note the total absence of A, E, I, O, U, and Y. Remarkable and definitely nonrandom event. Since a uniliteral substitution alphabet with 6 letters missing is highly unlikely, 
the next guess is we are dealing with a ти ега! substitution. Closer inspection shows that ten consonants are initials (B D G J L N Q S V X) and the remaining ten 
consonants are used as terminals (C F H K M P R T W Z). This implies both bipartite and biliteral character. 


We construct a digraphic distribution: 


C[F[HIK|M|PIR[T[w|z 
в[*[з[1[1[1[1 2212 

реја {1/3 [3 [1 [1[1[3[4 [2 
[с[*[2 2/2 [о [3 folo | [о [г 
3h fafa fa fa fafa fifi | 
1.[*[1 falo а [з fals [3 [3 [4 
| з ida b 
о 20 2 [1 hhh [1 
Is [212 [0 [2 fa [olofo [г 
vil lh [3 [4 la ble [3 
x[*lo f [о [1 [2 1 1 [o[2 Jo 


We assume the use of a small enciphering matrix with variants for rows and columns. We assume that the various possible cipher variants are of approximately equal 
frequency; the column indicators pair equally often with the row indicators of the enciphering matrix. We look for similar row profiles and column profiles. We match first 
the rows and then the columns. 


Row L and V distributions have pronounced similarities. They are "heavy" in their frequency distributions in the same places. So are rows D and N. They have 
homologous attributes in appearance. 


Finding the next rows are not obvious. We use a "goodness of match" procedure to equate interchangeable variants. We calculate the cross- product sums for each trial. 
The next heavy row is G. We test G against the remaining rows. 


о [ppt 


в |*+з1[1[1[2[2[1[2[1 


св "јер [о porro 


We compare the balance of rows: 


G*B |+|6|2 [2o [oo 1|o1 [-15 
c [+[22[2[о[з3[о[о[1[о[1[=11 
G*Q [--lo|4|o o[3 oo 
a*s |+ 2 |4[4o|e[o]o 
с*х |+[0|2 [00 o|e o 


The results are most probably match G and S. 


The next heaviest row is B. Testing against the remaining three rows we have: 


The correct pairings are B with J and Q with X. Since we have not found more than two rows for any one set of interchangeable values the original matrix has only five 
rows. 


c[F[H/K[m|P|R[T|w|z 
Bh 422] 1 ii 
Гр [хв [2 [8 [7 [2 [2/2 [s [7 [5 
Gls [з [4 [4 [о [5 [1 [0]1 [0 [2 
IL [у [о в (7 | Islo le [7 [7 
ојХ [о [з lo (з [3 2 [2 [о/з [о 


Values represent the sums of the combined rows. We apply the same process to matching columns. C and Н are a matched pair. F with M and P with R. We use the 
cross product sums for the balance of the columns. 


K*T+ |4 [35 [42 |- 
кауч [4 [49] 49]9. [111 
ez јаја 


T*w« |6 [35 [42 |- [83 


т*7+ |4|25|2 [42]73 | 


w*z+ [6 [35 [49 |- |90 


KT, WZ«|81. |-|oo- 171 


култе m s - se 


капи ве аст 


We would expect that the proper pairings are K with W and T with Z. 


в [6 [4 [5 [7 [4 [Pp [1962 
D|N|16|4 |14]4 |10|P- [1132 
Gls 7 [9 |- [1 [з [РНцо)=[1670 
[з рат 


olx|- le [6 |- | 


The converted cryptogram is solved via the principals in Lecture 2 and Lecture 3. The beginning of the message reads Weather forecast. The original keying matrix is 
recovered with a keyword of ATMOSPHERIC. 


The method of matching rows and columns applies equally well for all the matrices shown previously. It is key to start with the best rows and columns from not only 
heaviness standpoint but the distinctive crests and troughs. A second key is the low frequency letters. No variant system can adequately disguise low frequency letters and 


they will have the same frequency in the cipher text. Friedman describes a more general solution to variant analysis. [FRE1, p119 ff] 


Chapter 10 of reference [ЕКЕ1] covers the disruption process associated with monome-dinome alphabets of Irregular- Length cipher text units. Figures 4-14 and Figure 
4-15 show enciphering matrices where the encipherment is disrupted and commutative. The normal row conventions are used to encipher except when the row indicator 
was the same for the immediately preceding letter. In Figure 4-14, EIGHT could be encrypted 10 29 7 8 49 and then rearranged into standard groups of 5 letters 
(numbers). In Figure 4-15, E — 24 or 42, T — 621 or 162. Figure 4-16 is an example of the Russian disruption process added for security. 


ISOLOGS 


Cryptograms produced using identical plain text but subjected to different cryptographic treatment, and yielding different cipher texts are called isologs. (isos = equal and 
logos = word in Greek). Isologs are usually equal or nearly equal in length. Isologs, no matter how the cryptographic treatment varies, are among the most powerful tools 
available to the cryptanalyst to solve difficult cryptosystems. 


Take two messages A and B suspected of being isologs and write them out under each other. We then examine the similarities and differences. Assume the messages both 
start with "Reference your message..." I will arrange the messages in a special table to facilitate the study. 


Group No. 
E 10 15 

A 82 26 56 31 03 74 83 96 98 42 32 52 97 01 15 
A' 30 15 08 74 97 14 51 19 73 60 49 67 65 01 06 
B 8027 78 91 06 94 00 01 38 28 54 08 24 00 65 
в" 45 64 79 51 81 69 67 25 38 89 41 56 32 52 03 
© 63 62 93 39 16 43 15 89 10 49 26 45 84 50 39 
E 50 62 87 75 36 20 35 11 05 70 99 27 77 50 11 
D 81 71 35 25 38 73 30 92 07 49 61 75 21 64 76 
0" 35 19 99 01 38 99 97 45 02 32 04 11 58 92 16 
E 38 72 89 11 47 99 92 64 14 68 13 36 53 38 81 
E' 38 46 31 75 47 14 64 80 06 46 85 86 45 38 98 
F 69 69 79 38 16 51 75 05 70 74 11 80 44 32 55 
p? 26 12 18 39 78 94 89 93 37 28 11 27 22 05 04 
G 29 12 02 77 30 31 19 97 99 62 27 86 56 06 53 
G" 06 48 43 21 03 98 71 54 26 62 80 76 08 98 80 
H 80 87 04 09 67 46 59 41 98 55 10 82 22 29 87 
H’ aa 10 55 29 00 59 72 B2 28 55 87 30 07 Oe 93 
J 46 72 93 62 45 

J' 59 68 24 62 53 


The dinome distributions for these two messages are as follows: 


Message A | Message B 


Both distributions are too flat - no crests or troughs. We assume a variant system of a monoalphabetic cryptosystem. [FRE3] shows us how to use a Poisson exponential 
distribution to evaluate random text. The gist of the statistics is that the expected number of blanks is too low. The chi test indicates extreme non randomness for both 
messages. The chi test applied to both distributions implies that they both have been enciphered by the same cryptosystem because there exists a close correlation 
between the patterns of the two distributions. [FR1, p123} discusses the potentialities of the cryptomathematics as a supporting science to cryptography. 


There are several identical values between the messages. This implies that not only has the same cryptosystem been used but also the same enciphering matrix. The values 
38 and 62 must represent very low frequency letters because no variants are even provided for this letter. 


We now form isolog chains between the messages. 


(06 14 15 26 28 31 35 73 74 81 89 98 99) 
(02 07 20 22 43 44 63 90) 

(12 37 АВ 51 69 70 83 94) 

(03 30 41 54 65 82 97) 

(05 10 24 32 49 87 ЭЗ) 

(16 18 36 76 78 79 86) 

(27 45 53 64 80 92) 

(11 39 75 88) 

(21 58 77 84) 

(46 59 68 72) 

(00 52 87 
(04 55 61 
(08 29 56 
(19 71 96 
(01 25) 
(15. 85) Single Dinomes: 

(42 60) (38) (47) (50) (62) (31) 


These chains of cipher values represent identical plain text pairs. Beginning with the first value in the message 82 and 30 a partial chain of equivalent variants is formed; 
now locating the other occurrences of either value we note the value that coincides with it in the other message. We therefore extend the chain. The plain text values are 
arbitrarily fit into 10 x 10 square: 


1 BHHEEBRA = ACO 
2 4. ТТ = OME SE. МЕ 
3.5 OQO-=$-EHAN BB RB 
4eR2Y TES LY МЮ = 
С QS RPE = TLX 
6 . PUTS Ета МҮ 
7 .CLEEDAIAAN 
B&B БЕХЕНАОВЕ5 
9.G8SOQON -CREET 
C2. МОТ ВРО ЕТЕ 10 


Manipulating ће rows and columns with а view to uncovering ће keys or symmetry, we find a latent diagonal pattern without keyword. We set up the following 
enciphering matrix: 


7 .AÀABACDEEIILN 
1.AACDEEHKNO 
$ . A B DER AON OR 
8. ADEEHINORS 
9 .CEEGINORSI 
2 BERFEPM OG Ss TT 
G.EBFIMOP RE TO 
Se FLL PRS Fuss 
6 . Lith PRS TO WY 
4. LORS T PV YS 


I can not over emphasize the value of isologs. The value goes far beyond simple variant systems. Isologs produced by two different code books or two different 
enciphered code versions of the same plain text; or two encryptions of identical plain text at different settings of a cipher machine, may all prove of inestimable value in the 
attack on a difficult system. 


SYNOPTIC CHART OF CRYPTOGRAPHY PRESENTED IN LECTURES 1 -5 


Cryptograms 
Cipher Code Enciphered Code 
Substitution Transposition Combined 
Substitution = 
Transposition 
Monoalphabetic Multiple- Polyalphabetic 


Alphabetic 


Systems 


МЕЗЕ ЊЕ. „рее ONERE веет ен Multiliteral 


Standard ... Mixed 
Alphabets Alphabets 
Keyword ... Random 
Mixed Mixed 
Single Equivalent Variant бини ње 
Fixed Length Mixed Length 
Cipher Groups Cipher Groups 


Biliteral...N-literal $ 5 
Monome -Dinome Others 


Matrices with Non Bipartite 
Coordinates 
(Bipartite) 


Here is the tentative plan for the balance of the course. Just a plan - subject to revision. 


LECTURES 5 - 7 


We will cover recognition and solution of XENOCRYPTS (language substitution ciphers) in detail. 


LECTURES 8 - 12 


We will investigate and crack Polyalphabetic Substitution systems. 


LECTURES 13 - 18 


We will investigate and crack Cipher Exchange and Transpositions problems. 


LECTURE 19 
We will devote this lecture to International Law. 
LECTURES 20 - 23 
We will walk through the mathematical fields to solve Cryptarithms. 
LECTURES 24 - 25 


We will introduce modern cryptographic systems and field special topics. We will do a primer on PGP. 
SOLUTIONS TO HOMEWORK PROBLEMS FROM LECTURE 3 


Thanks to JOE-O for his concise sols. 


Mv-1. From Martin Gardner. 
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I presented Ms 
alphabetic code that I've never seen before. Maybe you can use it 


1 in a strange format. It fooled some but not all. The Key is 01=1=а, 02=2=,...26=7. the alphabet is standard. Message reads: " Here's a simple 
in you column. Martin Gardner, Hendersonville, North Carolina. 


Solve and reconstruct the cryptographic systems used. 
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Divide the original 1 m pn p ; пош fees а - — with 0,1, or 2 and ended with 0 - 9. Construct a matrix similar to Figure 3-2. (3 x 10) Fill in the matrix 
‚ dinomes to letters. = the Phi test and пе mon- easi cu hae icy, VOC count, 


Message reads:Reconnoiter Auys Cayes Bay at daylight seventeen April and then proceed through point George on course three three zero speed twelve period report 
noon pos: т 


Key = NEW YORK, 3 X 10 matrix, Rows 0,1,2, columns 0-9 and 00 blank. 
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Noted all entries were numbered 1-5. Assumed a 5 x 5 matrix filled with a straight alphabet, substituted letters for the dinomes. Used frequency count, contact count and 
phi test to confirm mono-alphabeticity. Identified 8 consonants and 2 vowels. Made the E, T assumption based on frequency. First word dropped as weather. Rest of 
message fell apart with addition of W, A, R to the matrix. 


Message reads: Weather forecast Thursday partly cloudy ... at present about one thousand feet. 


Key - Beginning column 1 - MONDAY, in 5 x 5 matrix. 


My last two problems were taken from reference [OP20] course. 
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